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Executive Summary 



The 2007 Trends in International Mathematics and Science 
Study (TIMSS) is the fourth administration since 1995 of this 
international comparison. Developed and implemented at 
the international level by the International Association for 
the Evaluation of Educational Achievement (IEA) — an 
international organization of national research institutions and 
governmental research agencies — -TIMSS is used to measure 
overtime the mathematics and science knowledge and skills 
of fourth- and eighth-graders. TIMSS is designed to align 
broadly with mathematics and science curricula in the 
participating countries. 

This report focuses on the performance of U.S. students 
relative to that of their peers in other countries in 2007, 
and on changes in mathematics and science achievement 
since 1995. 1 Thirty-six countries or educational jurisdictions 
participated at grade four in 2007, while 48 participated at 
grade eight. 2 This report also describes additional details 
about the achievement of U.S. student subpopulations. 

All differences described in this report are statistically 
significant at the .05 level. No statistical adjustments 
to account for multiple comparisons were used. 

Key findings from the report include the following: 

• In 2007, the average mathematics scores of both U.S. 
fourth-graders (529) and eighth-graders (508) were higher 
than the TIMSS scale average (500 at both grades). 3 
The average U.S. fourth-grade mathematics score 
was higher than those of students in 23 of the 35 other 
countries, lower than those in 8 countries (all located in 
Asia or Europe), and not measurably different from those 
in the remaining 4 countries. 4 At eighth grade, the average 
U.S. mathematics score was higher than those of students 
in 37 of the 47 other countries, lower than those in 5 
countries (all of them located in Asia), and not measurably 
different from those in the other 5 countries. 



• Compared to 1995, the average mathematics scores for 
both U.S. fourth- and eighth-grade students were higher in 
2007. At fourth grade, the U.S. average score in 2007 was 
529, 11 points higher than the 1995 average of 518. At 
eighth grade, the U.S. average mathematics score in 2007 
was 508, 16 points higher than the 1995 average of 492. 

• In 2007, 10 percent of U.S. fourth-graders and 6 percent 
of U.S. eighth-graders scored at or above the advanced 
international benchmark in mathematics. 5 At grade four, 
seven countries had higher percentages of students 
performing at or above the advanced international 
mathematics benchmark than the United States: 
Singapore, Hong Kong SAR, Chinese Taipei, Japan, 
Kazakhstan, England, and the Russian Federation. 
Fourth-graders in these seven countries were also found 
to outperform U.S. fourth-graders, on average, on the 
overall mathematics scale. At grade eight, a slightly 
different set of seven countries had higher percentages 
of students performing at or above the advanced 
mathematics benchmark than the United States: Chinese 
Taipei, Korea, Singapore, Hong Kong SAR, Japan, 
Hungary, and the Russian Federation. These seven 
countries include the five countries that had higher 
average overall mathematics scores than the United 
States, as well as Hungary and the Russian Federation. 

• In 2007, the average science scores of both U.S. fourth- 
graders (539) and eighth-graders (520) were higher 
than the TIMSS scale average (500 at both grades). 

The average U.S. fourth-grade science score was higher 
than those of students in 25 of the 35 other countries, 
lower than those in 4 countries (all of them in Asia), 
and not measurably different from those in the remaining 
6 countries. At eighth grade, the average U.S. science 
score was higher than the average scores of students 
in 35 of the 47 other countries, lower than those in 9 
countries (all located in Asia or Europe), and not 
measurably different from those in the other 3 countries. 



1 At grade four, a total of 257 schools and 10,350 students participated in the United States in 2007. At grade eight, 239 schools and 9,723 students participated. 
The overall weighted school response rate in the United States was 70 percent at grade four before the use of substitute schools. The final weighted student 
response rate at grade four was 95 percent. At grade eight, the overall weighted school response rate before the use of substitute schools was 68 percent. 

The final weighted student response rate at grade eight was 93 percent. 

2 The total number of countries reported here differs from the total number reported in the international TIMSS reports (Mullis et al. 2008; Martin et al. 2008). 

In addition to the 36 countries at grade four and 48 countries at grade eight, 8 other educational jurisdictions, or “benchmarking” entities, participated: the states 
of Massachusetts and Minnesota; the Canadian provinces of Alberta, British Columbia, Ontario, and Quebec; Dubai, United Arab Emirates; and the Basque region 
of Spain. 

3 TIMSS provides two overall scales — mathematics and science — as well as several content and cognitive domain subscales for each of the overall scales. 

The scores are reported on a scale from 0 to 1 ,000, with the TIMSS scale average set at 500 and standard deviation set at 100. 

4 TIMSS is open to countries and subnational entities, or educational jurisdictions, which are part of larger countries. For example, Hong Kong is a Special 
Administrative Region (SAR) of the People’s Republic of China. For convenience, this report uses the term “country” or “nation" to refer to all participating entities. 

5 TIMSS reports on four benchmarks to describe student performance in mathematics and science. Each benchmark is associated with a score on the achievement 
scale and a description of the knowledge and skills demonstrated by students at that level of achievement. The advanced international benchmark indicates that 
students scored 625 or higher. More information on the benchmarks can be found in the main body of the report and appendix A. 
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• The average science scores for both U.S. fourth- and 
eighth-grade students in 2007 were not measurably 
different from those in 1995. The U.S. fourth-grade 
average science score in 2007 was 539 and in 1995 
was 542. The U.S. eighth-grade average science score 
in 2007 was 520 and in 1995 was 513. 

• In 2007, 15 percent of U.S. fourth-graders and 10 percent 
of U.S. eighth-graders scored at or above the advanced 
international benchmark in science. At grade four, two 
countries had higher percentages of students performing 
at or above the advanced international science 
benchmark than the United States: Singapore and 
Chinese Taipei. Fourth-graders in these two countries 
were also found to outperform U.S. fourth-graders, on 
average, on the overall science scale. At grade eight, six 
countries had higher percentages of students performing 
at or above the advanced science benchmark than the 
United States: Singapore, Chinese Taipei, Japan, 
England, Korea, and Hungary. These six countries also 
had higher average overall eighth-grade science scores 
than the United States. 
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Introduction 

TIMSS in brief 

The Trends in International Mathematics and Science 
Study (TIMSS) 2007 is the fourth time since 1995 that this 
international comparison of student achievement has been 
conducted. Developed and implemented at the international 
level by the International Association for the Evaluation of 
Educational Achievement (IEA), an international organization 
of national research institutions and governmental research 
agencies, TIMSS is used to measure over time the 
mathematics and science knowledge and skills of fourth- 
and eighth-graders. 

TIMSS is designed to align broadly with mathematics and 
science curricula in the participating countries. The results, 
therefore, suggest the degree to which students have learned 
mathematics and science concepts and skills likely to have 
been taught in school. TIMSS also collects background 
information on students, teachers, and schools to allow 
cross-national comparison of educational contexts that may 
be related to student achievement. In 2007, there were 58 
countries and educational jurisdictions 1 that participated 
in TIMSS, at the fourth- or eighth-grade level, or both. 2 

This report presents the performance of U.S. students 
relative to their peers in other countries, and on changes in 
mathematics and science achievement since 1995. Most of 
the findings in the report are based on the results presented 
in two reports published by the IEA and available online at 
http://www.timss.ora : 

• TIMSS 2007 International Mathematics Report: 
Findings From lEA’s Trends in International Mathematics 
and Science Study at the Fourth and Eighth Grades 
(Mullis et al. 2008); and 

• TIMSS 2007 International Science Report: Findings 
From lEA’s Trends in International Mathematics 
and Science Study at the Fourth and Eighth Grades 
(Martin et al. 2008). 

For a number of participating countries, changes in 
achievement can be documented over the last 12 years, 



from 1995 to 2007. For other countries, changes can be 
documented over a shorter period of time. Table 1 and 
figure 1 show the countries that participated in TIMSS 2007 
as well as their participation status in the earlier TIMSS data 
collections. The TIMSS fourth-grade assessment was 
implemented in 1995, 2003, and 2007, while the eighth-grade 
assessment was implemented in 1995, 1999, 2003, and 2007. 

This report describes additional details about the achievement 
of U.S. students that are not available in the international 
reports, such as trends in the achievement of students of 
different racial and ethnic and socioeconomic backgrounds. 

Design and administration of TIMSS 

TIMSS 2007 is sponsored by the IEA and carried out under 
a contract with the TIMSS & PIRLS 3 International Study Center 
at Boston College. The National Center for Education 
Statistics (NCES), in the Institute of Education Sciences 
at the U.S. Department of Education, is responsible for the 
implementation of TIMSS in the United States. Data collection 
in the United States was carried out under contract to 
Windwalker Corporation and its subcontractors, Westat 
and Pearson Educational Measurement. 

Participating countries administered TIMSS to two national 
probability samples of students and schools, based on a 
standardized definition. Countries were required to draw 
samples of students who were nearing the end of their fourth 
year or eighth year of formal schooling, beginning with the 
International Standard Classification of Education (ISCED) 
Level I. 4 In most countries, including the United States, these 
students were in the fourth and eighth grades. Details on the 
grades assessed in each country are included in appendix A. 

In the United States, TIMSS was administered between April 
and June 2007. The U.S. sample included both public and 
private schools, randomly selected and weighted to be 
representative of the nation. 5 In total, 257 schools and 10,350 
students participated at grade four, and 239 schools and 
9,723 students participated at grade eight. The overall 
weighted school response rate in the United States was 70 



^IMSS is open to countries and subnational entities, or educational jurisdictions, which are part of larger countries. For example, Hong Kong is a Special 
Administrative Region (SAR) of the People's Republic of China. For convenience, this report uses the term “country” or “nation" to refer to all participating entities. 

2 Data from two nations were judged problematic by the IEA. Morocco failed to meet the required school participation rates in grade eight because of a procedural 
difficulty with some schools. Also, the quality of the data from Mongolia was not well documented at either grade level. In the international reports, Morocco is 
included in the fourth-grade tables but is shown “below the line” in the eighth-grade tables to indicate a problem in data quality. Data on Mongolia are reported 
in an appendix. For the purposes of the present report, statistics relating to Moroccan eighth-graders and to Mongolian students in both grades are not reported. 

3 The international study center takes its name from the two main IEA studies it coordinates; the Trends in International Mathematics and Science Study (TIMSS) 
and the Progress in International Reading Literacy Study (PIRLS). 

4 The ISCED was developed by the United Nations Educational, Scientific, and Cultural Organization (UNESCO) to assist countries in providing comparable, 
cross-national data. ISCED Level 1 is termed primary schooling, and in the United States is equivalent to the first through sixth grades (Matheson et al. 1996). 

5 The sample frame data for public schools in the United States was based on the 2006 National Assessment of Educational Progress (NAEP) sampling frame. 
This was done because recruitment of districts and schools began at the end of the 2005-06 school year to maximize response rates. The 2006 NAEP sampling 
frame was based on the 2003-04 Common Core of Data (CCD), and the data for private schools were from the 2003-04 Private School Universe Survey (PSS). 
Any school containing at least one grade four or one grade eight class was included in the school sampling frame. 
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Table 1 . Participation in the TIMSS fourth- and eighth-grade assessments, by grade and country: 
1995, 1999, 2003, and 2007 



Grade four Grade eight Grade four Grade eight 



Country 


1995 


2003 


2007 


1995 


1999 


2003 


2007 


Country 


1995 


2003 


2007 


1995 


1999 


2003 


2007 


Total 


26 


25 


36 


41 


38 


46 


48 


Total 


26 


25 


36 


41 


38 


46 


48 


Algeria 






✓ 








✓ 


Korea, Rep. of 


✓ 






✓ 


✓ 


✓ 


✓ 


Armenia 




✓ 


✓ 






✓ 


✓ 


Kuwait 


✓ 




V 


V 






✓ 


Australia 1 


✓ 


✓ 


✓ 


✓ 


✓ 


✓ 


✓ 


Latvia 5 


✓ 


✓ 


✓ 


V 


✓ 


✓ 




Austria 


✓ 




✓ 


✓ 








Lebanon 












✓ 


V 


Bahrain 












✓ 


✓ 


Lithuania 




✓ 


V 


V 


✓ 


✓ 


V 


Belgium (Flemish) 




✓ 




✓ 


✓ 


✓ 




Macedonia, Rep. of 










✓ 


✓ 




Belgium (French) 








✓ 








Malaysia 










✓ 


✓ 


V 


Bosnia and Flerzegovina 














✓ 


Malta 














V 


Botswana 












✓ 


✓ 


Moldova, Rep. of 




✓ 






V 


✓ 




Bulgaria 








✓ 


✓ 


✓ 


✓ 


Morocco 4 




✓ 


V 




✓ 


✓ 




Canada 


✓ 






✓ 


✓ 






Netherlands 


✓ 


✓ 


✓ 


✓ 


✓ 


✓ 




Chile 










✓ 


✓ 




New Zealand 


✓ 


✓ 


✓ 


✓ 


✓ 


✓ 




Chinese Taipei 




✓ 


✓ 




✓ 


✓ 


✓ 


Norway 


✓ 


✓ 


✓ 


✓ 




✓ 


V 


Colombia 






✓ 


✓ 






✓ 


Oman 














V 


Cyprus 


✓ 


✓ 




✓ 


✓ 


✓ 


✓ 


Palestinian Nat'IAuth. 












✓ 


V 


Czech Republic 


✓ 




✓ 


✓ 


✓ 




✓ 


Philippines 




✓ 






✓ 


✓ 




Denmark 






✓ 


✓ 








Portugal 


✓ 






V 








Egypt 












✓ 


✓ 


Qatar 






✓ 








V 


El Salvador 






✓ 








✓ 


Romania 








V 


✓ 


✓ 


V 


England 2 


✓ 


✓ 


✓ 


✓ 


✓ 


✓ 


✓ 


Russian Federation 




✓ 


✓ 


V 


✓ 


✓ 


V 


Estonia 












✓ 




Saudi Arabia 












✓ 


V 


Finland 










✓ 






Scotland 


✓ 


✓ 


✓ 


V 




✓ 


s 


France 








✓ 








Serbia 












✓ 


V 


Georgia 






✓ 








✓ 


Singapore 


V 


✓ 


✓ 


V 


✓ 


✓ 


V 


Germany 






✓ 


✓ 








Slovak Republic 






✓ 


V 


✓ 


✓ 




Ghana 












✓ 


✓ 


Slovenia 1 


V 


✓ 


✓ 


V 


✓ 


✓ 


V 


Greece 


✓ 






✓ 








South Africa 6 








✓ 


✓ 


✓ 




Hong Kong SAR 3 


✓ 


✓ 


✓ 


✓ 


✓ 


✓ 


✓ 


Spain 








✓ 








Hungary 


✓ 


✓ 


✓ 


✓ 


✓ 


✓ 


✓ 


Sweden 






V 


✓ 




✓ 


V 


Iceland 


✓ 






✓ 








Switzerland 








✓ 








Indonesia 










✓ 


✓ 


✓ 


Syrian Arab Republic 














V 


Iran, Islamic Rep. of 


✓ 


✓ 


✓ 


✓ 


✓ 


✓ 


✓ 


Thailand 


V 






✓ 


✓ 




V 


Ireland 


✓ 






✓ 








Tunisia 




✓ 


V 




✓ 


✓ 


V 


Israel 4 


✓ 






✓ 


✓ 


✓ 


✓ 


Turkey 










✓ 




V 


Italy 4 




✓ 


✓ 




✓ 


✓ 


✓ 


Ukraine 






V 








V 


Japan 


✓ 


✓ 


✓ 


✓ 


✓ 


✓ 


✓ 


United States 


V 


✓ 


V 


✓ 


✓ 


✓ 


V 


Jordan 










✓ 


✓ 


✓ 


Yemen 






V 











Kazakhstan ✓ 



because of national-level changes in the starting age/date for school, 1999 data for Australia and Slovenia cannot be compared to 2003 data. 

2 England collected data at grade eight in 1995, 1999, and 2003, but due to problems with meeting the minimum sampling requirements for 2003, its eighth-grade 
data are not shown in this report. 

3 Hong Kong is a Special Administrative Region (SAR) of the People’s Republic of China. 

4 Because of changes in the population tested, 1995 data for Israel and Italy, and 1999 data for Morocco are not shown. 

5 Only Latvian-speaking schools were included in 1995 and 1999. For trend analyses, only Latvian-speaking schools are included in the estimates. 

6 Because within-classroom sampling was not accounted for, 1995 data are not shown for South Africa. 

NOTE: No fourth-grade assessment was conducted in 1999. Only countries that completed the necessary steps for their data to appear in the reports from the 
International Study Center are listed. In addition to the countries listed above, eight separate jurisdictions participated in the Trends in International Mathematics 
and Science Study (TIMSS) 2007: the provinces of Alberta, British Columbia, Ontario, and Quebec in Canada; the Basque region of Spain; Dubai, UAE, and the 
states of Massachusetts and Minnesota. Information on these eight jurisdictions can be found in the international TIMSS 2007 reports. Morocco participated in 
TIMSS 2007 at both the fourth and eighth grades, but due to sampling difficulties, its grade eight data are not shown in this report. Mongolia also participated in 
TIMSS 2007 but could not complete the steps necessary to have its data included in the report. Countries could participate at either grade level. Countries were 
required to sample students enrolled in the grade corresponding to the fourth and eighth year of schooling, beginning with International Standard Classification 
of Education (ISCED) level 1 , providing that the mean age at the time of testing was at least 9.5 years and 1 3.5 years, respectively. In the United States and most 
countries, this corresponds to grade four and grade eight. See table A1 in appendix A for details. 

SOURCE: International Association for the Evaluation of Educational Achievement (IEA), Trends in International Mathematics and Science Study (TIMSS), 1995, 
1999, 2003 and 2007. 
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Figure 1 . Countries that participated in TIMSS 2007 



Norway. 



Slovak Republic 
Lithuania 

Latvia , Ukraine 

. Bulgaria 




SOURCE: International Association for the Evaluation of Educational Achievement (IEA), Trends in International Mathematics and Science Study (TIMSS), 2007. 



percent at grade four before the use of substitute schools and 
89 percent with the inclusion of substitute schools. 6 At grade 
eight, the overall weighted school response rate before the 
use of substitute schools was 68 percent and 83 percent with 
the inclusion of substitute schools. The final weighted student 
response rate at grade four was 95 percent and at grade eight 
was 93 percent. Student response rates are based on a 
combined total of students from both sampled and substitute 
schools. Detailed information on sampling, administration, 
response rates, and other technical issues are included 
in appendix A. 

Reporting TIMSS results 

Achievement results from TIMSS are reported on a scale from 
0 to 1 ,000, with a TIMSS scale average of 500 and standard 
deviation of 100. Even though the countries participating in 
TIMSS have changed across the four assessments between 



1995 and 2007, comparisons between the 2007 results and 
prior results are still possible because the achievement scores 
in each of the TIMSS assessments are placed on a scale which 
is not dependent on the list of participating countries in any 
particular year. A brief description of the assessment equating 
and scaling is presented in appendix A to this volume. A more 
detailed presentation can be found in the TIMSS 2007 
Technical Report (Olson, Martin, and Mullis 2008). 

In addition to numerical scale results, TIMSS also includes 
international benchmarks. The TIMSS international benchmarks 
provide a way to interpret the scale scores and to understand 
how students’ proficiency in mathematics and science varies 
along the TIMSS scale. The TIMSS benchmarks describe four 
levels of student achievement in each subject, based on the 
kinds of skills and knowledge students at each score outpoint 
would need to successfully answer the mathematics and 
science items. In general, the score outpoints for the TIMSS 
benchmarks were set based on the distribution of students 



6 NCES standards advise that substitute schools should not be included in the calculation of response rates (standard 1-3-8; National Center for Education 
Statistics 2002). Response rates calculated “before replacement” are consistent with this standard. Response rates calculated “after replacement" include 
substitute schools and hence are not consistent with NCES standards. Both kinds of response rates are reported here in the interests of comparability with 
the TIMSS international reports which report response rates before and after replacement. 
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along the TIMSS scale. More information on the development 
of the benchmarks and the procedures used to set the score 
outpoints can be found in the TIMSS 2007 Technical Report 
(Olson, Martin, and Mullis 2008). 

All differences described in this report are statistically 
significant at the .05 level. No statistical adjustments to 
account for multiple comparisons were used. Differences that 
are statistically significant are discussed using comparative 
terms such as “higher” and “lower.” Differences that are not 
statistically significant are either not discussed or referred to 
as “not measurably different” or “not statistically significant.” 
In this latter case, failure to find a difference as statistically 
significant does not necessarily mean that there was no 
difference. It simply means that, given the precision of the 
estimates, there is a larger than five percent chance that the 
difference was zero. In addition, because the results of tests 
of statistical significance are, in part, influenced by sample 
sizes, statistically significant results may not identify those 
findings that have policy or practical importance. For this 
reason, this report includes effect sizes to provide the reader 
with a sense of the magnitude of statistically significant 
differences. Further information about effect sizes and about 
the tests conducted to determine statistical significance can 
be found in appendix A. Supplemental tables providing all 
estimates and standard errors discussed in this report are 
available online at http://nces.ed.gov/pubsearch/pubsinfo. 
asp?pubid=2009001 . 

All data presented in this report are used to describe 
relationships between variables. These data are not intended, 
nor can they be used, to imply causality. Student performance 
can be affected by a complex mix of educational and other 
factors that are not examined here. 

Nonresponse bias in the U.S. 

TIMSS samples 

NCES standards require a nonresponse bias analysis if 
school-level response rates fall below 85 percent, as they 
did for both the fourth- and eighth-grade school samples 
in TIMSS 2007. 7 As a consequence, a nonresponse bias 
analysis was undertaken, similar to that used for TIMSS 
2003 (Ferraro and Van De Kerckhove 2006). 

These analyses examined whether the participation status 
of schools (participant/non-participant) was related to seven 
school characteristics: the region of the country in which the 
school was located (Northeast, Southeast, Central, West); 



the type of community served by the school (central city, 
urban fringe/large town, rural/small town); whether the school 
was public or private; percentage of students eligible for free 
or reduced-price lunch; number of students enrolled in fourth 
or eighth grade; total number of students; and percentage of 
students from minority backgrounds. Details are provided in 
appendix A. 8 

The findings indicate some potential for bias in the data 
arising from regional and community-type differences in 
participation, along with the fact that schools with higher 
percentages of minority students were less likely to 
participate. Specifically, grade 4 schools in the central region 
were more likely to participate than schools in the other 
regions, and schools in rural/small towns were more likely 
to participate than schools in central cities. However with 
the inclusion of substitute schools there were no measurable 
differences by region and differences by community type were 
substantially reduced. At grade 8, after substitution, the results 
of the analyses indicated that schools in central cities were 
still more likely to participate than schools in urban/fringe/large 
towns. At both grades, schools with higher percentages 
of minority students were less likely to participate, but 
the measurable differences were small after substitution 
especially at grade 8. Since TIMSS is conducted under a 
set of standard rules designed to facilitate international 
comparisons, the U.S. nonresponse bias analysis results 
were not used to adjust the U.S. data for this source of bias. 
While this may be possible at some later date, at present the 
variables identified above remain as potential sources of bias 
in the published estimates. 

Further information 

To assist the reader in understanding how TIMSS relates 
to the National Assessment of Educational Progress (NAEP), 
the primary source of national- and state-level data on U.S. 
students’ mathematics and science achievement, NCES 
compared the form and content of the TIMSS and NAEP 
mathematics and science assessments. A summary of 
the results of this comparison is included in appendix C. 
Appendix D includes a list of TIMSS publications and 
resources published by NCES and the IEA. Standard errors 
for the estimates discussed in the report are available online 
at http://nces.ed. aov/pubsearch/pubsinfo.asD?pubid= 
2009001 . Detailed information on TIMSS can also be found 
on the NCES website ( http://nces.ed.gov/timss 1 and the 
international TIMSS website ( http://www.timss.org ). 



7 Standard 2-2-2 found in National Center for Education Statistics 2002. 

8 Thefull text of the nonresponse bias analysis conducted for TIMSS 2007 will be included in a technical report released with the U.S. national dataset. 
See appendix A for a description of the analyses undertaken and additional details on the findings. 
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Mathematics Performance in the United States 
and Internationally 



The TIMSS mathematics 
assessment 

The TIMSS mathematics assessment is designed along two 
dimensions: the mathematical topics or content that students 
are expected to learn and the cognitive skills students are 
expected to have developed. The topical or content domains 
(as they are called in TIMSS) covered at grade four are 
number, geometric shapes and measures, and data display 
(table 2). At grade eight, the content domains are number, 
algebra, geometry, and data and chance. The cognitive 
domains in each grade are knowing, applying, and reasoning. 
Example items from the TIMSS mathematics assessment are 
included in appendix B (see items B1 through B7). 

The proportion of items devoted to a domain, and, therefore, 
the contribution of the domain to the overall mathematics 
scale score differs somewhat across grades. For example, 
in 2007 at grade four, 52 percent of the TIMSS mathematics 
assessment focused on the number domain, while the 
analogous percentage at grade eight was 29 percent. 

The proportion of items devoted to each cognitive domain 
was similar across grades. 



Also, within a content or cognitive domain, the makeup of 
items, in terms of difficulty and form of knowledge and skills 
addressed, differs across grade levels to reflect the nature, 
difficulty, and emphasis of the subject matter encountered in 
school at each grade. TIMSS 2007 Assessment Frameworks 
(Mullis et al. 2005) provides a more detailed description 
of the content and cognitive domains assessed in TIMSS. 

The development and validation of the cognitive domains 
is detailed in lEA’s TIMSS 2003 International Report on 
Achievement in the Mathematics Cognitive Domains: 
Findings From a Developmental Project (Mullis, Martin, 
and Foy 2005). 

TIMSS provides an overall mathematics scale score as well 
as content and cognitive domain scores at each grade level. 
The TIMSS mathematics scale is from 0 to 1,000 and the 
international mean score is set at 500, with a standard deviation 
of 100. The scaling of data is conducted separately for each 
grade and each content domain. Thus, a score of 500 on the 
grade four scale is not equivalent to a score of 500 on the 
grade eight scale The scaling of data is conducted separately 
for each grade and each content domain. While the scales 
were created to each have a mean of 500 and a standard 
deviation of 100, the subject matter and the level of difficulty 
of items necessarily differ between the assessments at both 
grades. Therefore, direct comparisons between scores across 
grades should not be made. See appendix A for more details. 



Table 2. Percentage of fourth- and eighth-grade TIMSS mathematics assessment 
devoted to content and cognitive domains: 2007 



Grade four 




Grade eight 






Percent of 




Percent of 


Content domains 


assessment 


Content domains 


assessment 


Number 


52 


Number 


29 


Geometric shapes and measures 


34 


Algebra 


30 


Data display 


15 


Geometry 


22 






Data and chance 


19 




Percent of 




Percent of 


Cognitive domains 


assessment 


Cognitive domains 


assessment 


Knowing 


39 


Knowing 


38 


Applying 


39 


Applying 


41 


Reasoning 


22 


Reasoning 


21 



NOTE: The content and cognitive domains are the foundation of the Trends in International Mathematics and Science Study (TIMSS) 
assessment. The content domains define the specific mathematics subject matter covered by the assessment, and the cognitive 
domains define the sets of behaviors expected of students as they engage with the mathematics content. Each mathematics content 
domain has several topic areas. Each topic area is presented as a list of objectives covered in a majority of participating countries, 
at either grade four or grade eight. However, the cognitive domains of mathematics are defined by the same three sets of expected 
behaviors — knowing, applying, and reasoning. Detail may not sum to totals because of rounding. 

SOURCE: International Association for the Evaluation of Educational Achievement (IEA), Trends in International Mathematics and 
Science Study (TIMSS), 2007. 
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Scores within a subject and grade are comparable over time. 
The TIMSS scale was established originally to have a mean 
of 500 based on the average of all of the countries that 
participated in TIMSS 1995 at the fourth and eighth grades. 
Successive TIMSS assessments since then (TIMSS 1999, 
2003, and 2007) have scaled the achievement data so that 
scores are equivalent from assessment to assessment. 

That is, a score of 500 in eighth-grade mathematics in 2007 
is equivalent to a score of 500 in eighth-grade mathematics 
in 2003, in 1999, and in 1995. The same is true for the fourth- 
grade scale: a score of 500 in fourth-grade mathematics 
in 2007 is equivalent to a score of 500 in fourth-grade 
mathematics in 2003 and 1995. More information on how 
the TIMSS scale was created can be found in appendix A. 



Average scores in 2007 

The average mathematics scores for both U.S. fourth- and 
eighth-graders were higher than the TIMSS scale average 
(table 3). In 2007, the average score of U.S. fourth-graders 
was 529 and the average score of U.S. eighth-graders was 
508, compared with the TIMSS scale average of 500 at each 
grade level. 

At grade four, the average U.S. mathematics score was higher 
than those in 23 of the 35 other countries, lower than those in 
8 countries (all 8 were in Asia or Europe), and not measurably 
different from the average scores in the remaining 4 countries. 

At grade eight, the average U.S. mathematics score was higher 
than those in 37 of the 47 other countries, lower than those in 
5 countries (all of them located in Asia), and not measurably 
different from the average scores in the other 5 countries. 
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Table 3. Average mathematics scores 

of fourth- and eighth-grade students, 
by country: 2007 

Grade four Grade eight 





Average 


Country 


score 


TIMSS scale average 


500 




Hong Kong SAR 1 


607 


Singapore 


599 


Chinese Taipei 


576 


Japan 


568 


Kazakhstan 2 


549 


Russian Federation 


544 


England 


541 


Latvia 2 


537 


Netherlands 3 


535 


Lithuania 2 


530 


United States 45 


529 


Germany 


525 


Denmark 4 


523 


Australia 


516 


Hungary 


510 


Italy 


507 


Austria 


505 


Sweden 


503 


Slovenia 


502 


Armenia 


500 


Slovak Republic 


496 


Scotland 4 


494 


New Zealand 


492 


Czech Republic 


486 


Norway 


473 


Ukraine 


469 


Georgia 2 


438 


Iran, Islamic Rep. of 


402 


Algeria 


378 


Colombia 


355 


Morocco 


341 


El Salvador 


330 


Tunisia 


327 


Kuwait 6 


316 


Qatar 


296 


Yemen 


224 



Average 

Country score 

TIMSS scale average 500 



Chinese Taipei 
Korea, Rep. of 
Singapore 
Hong Kong SAR 14 
Japan 



Hungary 

England 4 

Russian Federation 
United States 4 5 
Lithuania 2 
Czech Republic 


517 

513 

512 

508 

506 

504 


Slovenia 


501 


Armenia 


499 


Australia 


496 


Sweden 


491 


Malta 


488 


Scotland 4 


487 


Serbia 2 ' 5 


486 


Italy 


480 


Malaysia 


474 


Norway 


469 


Cyprus 


465 


Bulgaria 


464 


Israel 7 


463 


Ukraine 


462 


Romania 


461 


Bosnia and Herzegovina 


456 


Lebanon 


449 


Thailand 


441 


Turkey 


432 


Jordan 


427 


Tunisia 


420 


Georgia 2 


410 


Iran, Islamic Rep. of 


403 


Bahrain 


398 


Indonesia 


397 


Syrian Arab Republic 


395 


Egypt 


391 


Algeria 


387 


Colombia 


380 


Oman 


372 


Palestinian Nat'IAuth. 


367 


Botswana 


364 


Kuwait 6 


354 


El Salvador 


340 


Saudi Arabia 


329 


Ghana 


309 


Qatar 


307 



598 

597 

593 

572 

570 



■ Average score is higher than U.S. average score (p < .05) 

Q Average score is not measurably different from the U.S. average score (p < .05) 

■ Average score is lower than the U.S. average score (p < .05) 

'Hong Kong is a Special Administrative Region (SAR) of the People's Republic 
of China. 

2 National Target Population does not include all of the International Target 
Population defined by the Trends in International Mathematics and Science 
Study (TIMSS) (see appendix A). 

3 Nearly satisfied guidelines for sample participation rates only after substitute 
schools were included (see appendix A). 

4 Met guidelines for sample participation rates only after substitute schools were 
included (see appendix A). 

5 National Defined Population covers 90 percent to 95 percent of National 
Target Population (see appendix A). 

6 Kuwait tested the same cohort of students as other countries, but later 
in 2007, at the beginning of the next school year. 

7 National Defined Population covers less than 90 percent of National Target 
Population (but at least 77 percent, see appendix A). 

NOTE: Countries are ordered by 2007 average score. The tests for significance 
take into account the standard error for the reported difference. Thus, a small 
difference between the United States and one country may be significant while 
a large difference between the United States and another country may not be 
significant. The standard errors of the estimates are shown in tables E-1 and E-2 
available at http://nces.ed. gov/pubsearch/pubsinfo.asp?pubid=2009001 . 
SOURCE: International Association for the Evaluation of Educational 
Achievement (IEA), Trends in International Mathematics and Science Study 
(TIMSS), 2007. 
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Trends in scores since 1995 

Several countries participated in both the first TIMSS in 1995 
and the most recent TIMSS in 2007 and therefore the average 
scores can be compared over a 12-year period. At grade four, 
16 countries, including the United States, participated in both 
the first and most recent TIMSS administrations. Comparing 
2007 mathematics scores with those from 1995, one-half of 
the countries (8 of 16), including the United States, showed 
improvement in average scores and one-quarter of the 
countries (4 of 16) showed declines (table 4). In 2007, the U.S. 
fourth-grade average mathematics score of 529 was 11 scale 
score points higher than the 1995 average of 518. 

The gain in the U.S. fourth-grade average mathematics score 
(11 scale score points) was greater than the difference in six 
countries (the four countries with declines in average scores, 



as well as two other countries) and less than the gain of four 
countries (England, Hong Kong SAR, Slovenia, and Latvia). 
There was no measurable difference between the 11 score 
point gain in the United States and the gains or declines in 
score points experienced in the other countries. 

At grade eight, 20 countries, including the United States, 
participated in TIMSS in both 1995 and 2007. About one- 
quarter of the countries (6 of 20), including the United States, 
had higher average mathematics scores in 2007 than in 1995 
and students in one-half of the countries (1 0 of 20) showed 
declines in their average scores. The U.S. eighth-grade 
average mathematics score of 508 was 16 scale score points 
higher than the 1995 average of 492. 

The gain in the U.S. eighth-grade mathematics score 
(16 scale score points) was greater than the difference 



Table 4. Trends in average mathematics scores of fourth- and eighth-grade students, by country: 
1995 to 2007 





Grade four 








Grade eight 








Average score 


Difference 1 




Average score 


Difference 1 


Country 


1995 


2007 


2007-1995 


Country 


1995 


2007 


2007-1995 


England 


484 


541 


57* 


Colombia 


332 


380 




Hong Kong SAR 2 


557 


607 


50* 


Lithuania 3 


472 


506 


Slovenia 


462 


502 


40* 


Korea, Rep. of 


581 


597 


17* 


Latvia 3 


499 


537 


38* 


United States 4 5 


492 


508 


16* 


New Zealand 


469 


492 


23* 


England 4 


498 


513 


16* 


Australia 


495 


516 


22* 


Slovenia 


494 


501 


7* 


Iran, Islamic Rep. of 


387 


402 


15* 


Hong Kong SAR 2 4 


569 


572 


4 


United States 45 


518 


529 


11* 


Cyprus 


468 


465 


-2 


Singapore 


590 


599 


9 


Scotland 4 


493 


487 


-6 


Scotland 4 


493 


494 


1 


Hungary 


527 


517 


-10* 


Japan 


567 


568 


1 


Japan 


581 


570 


-11* 


Norway 


476 


473 


-3 


Russian Federation 


524 


512 


-12 


Hungary 


521 


510 


-12* 


Romania 


474 


461 


-12* 


Netherlands 6 


549 


535 


-14* 


Australia 


509 


496 


-13* 


Austria 


531 


505 


-25* 


Iran, Islamic Rep. of 


418 


403 


-15* 


Czech Republic 


541 


486 


-54* 


Singapore 


609 


593 


-16* 










Norway 


498 


469 


-29* 










Czech Republic 


546 


504 


-42* 










Sweden 


540 


491 


-48* 










Bulgaria 


527 


464 


-63* 



■ Country difference in average scores between 1995 and 2007 is greater than analogous U.S. difference (p < .05) 

□ Country difference in average scores between 1995 and 2007 is not measurably different from analogous U.S. difference (p < .05) 

□ Country difference in average scores between 1995 and 2007 is less than analogous U.S. difference (p < .05) 

*p < .05. Within-country difference between 1995 and 2007 average scores is significant. 

difference calculated by subtracting 1995 from 2007 estimate using unrounded numbers. 

2 Hong Kong is a Special Administrative Region (SAR) of the People's Republic of China. 

3 ln 2007, National Target Population did not include all of the International Target Population defined by the Trends in International Mathematics and Science Study 
(TIMSS) (see appendix A). 

4 ln 2007, met guidelines for sample participation rates only after substitute schools were included (see appendix A). 

5 ln 2007, National Defined Population covered 90 percent to 95 percent of National Target Population (see appendix A). 

6 ln 2007, nearly satisfied guidelines for sample participation rates only after substitute schools were included (see appendix A). 

NOTE: Countries are ordered based on the difference in 1995 and 2007 average scores. All countries met international sampling and other guidelines in 2007, 
except as noted. Data are not shown for some countries, because comparable data from previous cycles are not available. The tests for significance take into 
account the standard error for the reported difference. Thus, a small difference between averages for one country may be significant while a large difference for 
another country may not be significant. Detail may not sum to totals because of rounding. The standard errors of the estimates are shown in tables E-1 and E-2 
available at http://nces.ed. gov/pubsearch/pubsinfo.asp?pubid=2009001 . 

SOURCE: International Association for the Evaluation of Educational Achievement (IEA), Trends in International Mathematics and Science Study (TIMSS), 1995 
and 2007. 
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in 13 countries (including the 10 countries with declining scores 
and 3 others) and less than the gain of 2 countries (Colombia 
and Lithuania). There was no measurable difference between 
the 16 score point gain in the United States and the gains 
or declines in score points experienced in the other countries. 



Figure 2. Difference between average 

mathematics scores of U.S. fourth- 
and eighth-grade students and the 
TIMSS scale average: 1995, 1999, 
2003, and 2007 



The size of the difference in scores between the U.S. fourth- 
graders’ and TIMSS scale averages was larger in 2007 at 29 
scale score points than it was in 1995 at 18 scale score points 
(figure 2). U.S. fourth-graders’ average mathematics scores 
were higher than the TIMSS scale average in each of the 3 
data collection years: 1995, 2003, and 2007. 

U.S. eighth-graders’ average mathematics scores showed no 
measurable difference from the TIMSS scale average in 3 of 
the 4 data collection years between 1995 and 2007. However, 
the 2007 U.S. score was higher than the U.S. score in 1995, 
with the U.S. score in 1995 some 8 points below the TIMSS 
scale average, but 8 points above the average in 2007. 



U.S. difference from TIMSS scale average 



Grade four 




-40- 



-60- 



-80 J 



Year 



U.S. difference from TIMSS scale average Grade eight 




*p < .05. Difference between U.S. average and Trends in International 
Mathematics and Science Study (TIMSS) scale average is statistically significant. 
'No fourth-grade assessment was conducted in 1999. 

NOTE: In 2007, the United States met guidelines for sample participation rates 
only after substitute schools were included. The National Defined Population 
covered 90 percent to 95 percent of National Target Population (see appendix A). 
Difference calculated by subtracting the TIMSS scale average (500) from the 
U.S. average mathematics score. The standard errors of the estimates are shown 
in table E-39 available at http://nces.ed.aov/pubsearch/pubsinfo.asp?pubid= 
2009001 . 

SOURCE: International Association for the Evaluation of Educational 
Achievement (IEA), Trends in International Mathematics and Science Study 
(TIMSS), 1 995, 1 999, 2003, and 2007. 
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Content and cognitive domain 
scores in 2007 

In addition to an overall mathematics score, TIMSS provides 
scores for content domains and cognitive domains (see table 
5 for a description of the cognitive domains). U.S. fourth- 
graders scored higher than the TIMSS scale average across 
the mathematics content domains in 2007 (table 6). U.S. 
fourth-graders’ average scores in number, geometric shapes 
and measures, and data display were between 22 and 43 
scale score points above the TIMSS scale average of 500 
in each content domain. 

U.S. fourth-graders performed better on average in the data 
display domain than in the number and geometric shapes 
and measures domains, at least in terms of comparisons 
with other countries. That is, there were fewer countries that 
outperformed the United States in data display than in the 
other two domains. U.S. fourth-graders outperformed their 
peers in 22 countries in the number domain, 20 countries 
in the geometric shapes and measures domain, and 28 
countries in the data display domain. They were outperformed 
by their peers in 9 countries in the number domain, 10 
countries in the geometric shapes and measures domain, 
and 4 countries in the data display domain. 



In the three cognitive domains, U.S. fourth-graders scored 
higher than the TIMSS scale average in 2007. U.S. fourth- 
graders’ average scores in the knowing, applying, and 
reasoning domains were between 23 and 41 scale score 
points higher than the TIMSS scale average of 500. 

In terms of comparisons with other countries, U.S. fourth- 
graders performed relatively better on average in the applying 
domain than the knowing and reasoning domains. U.S. fourth- 
graders outperformed students in 16 to 27 countries across 
the three cognitive domains and were outperformed by their 
peers in 5 to 11 countries across the three cognitive domains. 

At the eighth-grade level, U.S. students scored higher, on 
average, than the TIMSS scale average in two of the four 
mathematics content domains in 2007 (table 7). U.S. eighth- 
graders’ average scores in number and data and chance were 
10 and 31 scale score points above the TIMSS scale score 
average of 500, respectively. On the other hand, U.S. eighth- 
graders’ average score in the geometry domain was lower than 
the TIMSS scale score average by 20 scale score points. There 
was no measurable difference between U.S. eighth-graders’ 
average score in algebra and the TIMSS scale score average. 

U.S. eighth-graders performed relatively better, on average, 
in the data and chance domain than in the number, algebra, 



Table 5. Description of TIMSS mathematics cognitive domains: 2007 



Cognitive domain 


Description 


Knowing 


Knowing addresses the facts, procedures, and concepts that students need to know to function mathematically. The key skills of 
this cognitive domain include recalling definitions, terminology, number properties, geometric properties, and notation; recognizing 
mathematical objects, shapes, numbers, and expressions; recognizing mathematical entities that are mathematically equivalent; 
computing algorithmic procedures for basic functions with whole numbers, fractions, decimals, and integers; approximating 
numbers to estimate computations; carrying out routine algebraic procedures; retrieving information from graphs, tables, and 
charts; reading simple scales; using appropriate units of measure and measuring instruments; estimating measures; classifying 
or grouping objects, shapes, numbers, and expressions according to common properties; making correct decisions about class 
membership; and ordering numbers and objects by attributes. 


Applying 


Applying focuses on students’ abilities to apply knowledge and conceptual understanding to solve problems or answer questions. 
The key skills of this cognitive domain include selecting appropriate operations, methods, or strategies for solving problems where 
there is a known algorithm or method of solution; representing mathematics information and data in diagrams, tables, graphs, 
and charts; generating equivalent representations for a given mathematical entity or relationship; generating an appropriate 
mathematical model, such as an equation or diagram for solving a routine problem; following and executing a set of mathematical 
instructions; drawing figures and shapes given specifications; solving routine problems (i.e., problems similar to those students are 
likely to have encountered in class); comparing and matching different representations of data (grade eight) and using data from 
charts, tables, graphs, and maps to solve routine problems. 


Reasoning 


Reasoning goes beyond the cognitive processes involved in solving routine problems to include unfamiliar situations, complex 
contexts, and multistep problems. The key skills of this cognitive domain include determining and describing relationships between 
variables or objects in mathematical situations; using proportional reasoning (grade four); decomposing geometric figures to simplify 
solving a problem; drawing the net of a given unfamiliar solid; visualizing transformations of three-dimensional figures; comparing 
and matching different representations of the same data (grade four); making valid inferences from given information; generalizing 
mathematical results to wider applications; combining mathematical procedures to establish results and combining results to 
produce a further result; making connections between different elements of knowledge and related representations; making linkages 
between different elements of knowledge and related representations; making linkages between related mathematical ideas; 
providing a justification for the truth or falsity of a statement by reference to mathematical results or properties; solving problems 
set in mathematical or real life contexts that students are unlikely to have encountered before; applying mathematical procedures 
in unfamiliar or complex contexts; and using geometric properties to solve non-routine problems. 



NOTE: The descriptions of the cognitive domains are the same for grades four and eight, except where noted. 

SOURCE: International Association for the Evaluation of Educational Achievement (IEA), Trends in International Mathematics and Science Study (TIMSS), 2007. 
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